Searching XML Databases for Semantically-related Schemas

نویسندگان

  • Gauri Shah
  • Tanveer Syeda-Mahmood
چکیده

In this paper, we address the problem of searching schema databases for semantically-related schemas. We first give a method of finding semantic similarity between pair-wise schemas based on tokenization, part-of-speech tagging, word expansion, and ontology matching. We then address the problem of indexing the schema database through a semantic hash table. Matching schemas in the database are found by hashing the query attributes and recording peaks in the histogram of schema hits. Results indicated a 90% improvement in search performance while maintaining high precision and recall.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Approach for Clustering Semantically Heterogeneous XML Schemas

In this paper we illustrate an approach for clustering semantically heterogeneous XML Schemas. The proposed approach is driven mainly by the semantics of the involved Schemas that is defined by means of the interschema properties existing among concepts represented therein. An important feature of our approach consists of its capability to be integrated with almost all the clustering algorithms...

متن کامل

Towards Semantic Integration of XML-based Business Process Models

This paper discusses the applicability of schema integration methodology for the integration of XML Schemas for business process modelling. This methodology builds upon the assumption that the integrated schema has to support queries and updates on all underlying local schemas. The heterogeneous schemas of Business Process Execution Language for Web Services (BPEL4WS) and Petri Net Markup Langu...

متن کامل

An Approach to Extracting Sub-schema Similarities from Semantically Heterogeneous XML Schemas

This paper presents a semi-automatic approach to deriving sub-schema similarities from semantically heterogeneous XML Schemas. The proposed approach is specific for XML, almost automatic and light. It consists of two phases: the first phase selects the most promising pairs of sub-schemas, the second one examines them and returns only those which are similar. This paper describes the approach in...

متن کامل

Mapping DTDs to relational schemas with semantic constraints

XML is becoming a prevalent format and standard for data exchange in many applications. With the increase of XML data, there is an urgent need to research some efficient methods to store and manage XML data. As relational databases are the primary choices for this purpose considering their data management power, it is necessary to research the problem of mapping XML schemas to relational schema...

متن کامل

Ontology-based heterogeneous XML data integration

In this paper we present an ontology-based method for formalizing the implicit semantic and we suggest mechanisms to semantically integrate XML schemas and documents as well. After a survey of database interoperability, we present our semantic integration approach by explaining the nature of ontology. The article then presents our integration method for XML data and schemas using a generic onto...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004